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Amendments to the Claims: 

This listing of claims will replace all prior versions and listings of claims in the application: 
Listing of Claims: 

Claim 1 (previously submitted): A method for searching a collection of items, wherein each item 
in the collection has a set of properties, comprising the steps of: 

obtaining a query composed of a first set of one or more properties; and 

obtaining a result based on applying a distance function to the query and an item in the 
collection having a second set of one or more properties, wherein 

obtaining a result includes determining a third set of properties common to the 
first set of one or more properties and the second set of one or more properties, and 

the distance function determines a distance between the query and an item in the 
collection based on the number of items in the collection that are associated with all of the 
properties in the third set of properties. 

Claim 2 (original): The method of claim 1, further including the step of associating each item in 
the collection with a set of properties. 

Claim 3 (previously submitted): The method of claim 1, wherein the step of obtaining a result 
includes identifying one or more result items whose distance from the query is within a first 
threshold. 

Claim 4 (previously submitted): The method of claim 3, wherein the step of obtaining a result 
includes ranking the one or more result items according to their distance from the query. 

Claim 5 (original): The method of claim 3, wherein the threshold is defined as a number of 
result items. 

Claim 6 (original): The method of claim 3, wherein the threshold is defined as a distance. 

Claim 7 (original): The method of claim 1, further including the step of returning the result. 

Claim 8 (original): The method of claim 1, wherein the step of obtaining a query includes the 
step of mapping a received query to a set of one or more properties. 
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Claim 9 (original): The method of claim 1, wherein one or more of the properties are binary. 

Claim 10 (original): The method of claim 1, wherein one or more of the properties are related by 
a partial order, and wherein, if an item is associated with a property, then the item is also 
associated with all ancestors of that property in the partial order. 

Claim 1 1 (previously submitted): The method of claim 10, wherein one or more of the 
properties represent numerical values or ranges, and wherein the partial order reflects a set of 
containment relationships among the numerical values or ranges. 

Claim 12 (original): The method of claim 1, wherein the properties are grouped into equivalence 
classes. 

Claim 13 (original): The method of claim 12, further including the step of grouping the 
properties into equivalence classes using clustering. 

Claim 14 (original): The method of claim 13, wherein each property has a set of subproperties, 
wherein the clustering is performed such that the distance between two properties in the 
collection is correlated to the number of properties in the collection that are associated with all of 
the subproperties common to both properties. 

Claim 15 (original): The method of claim 1, wherein the query corresponds to a single item in 
the collection. 

Claim 16 (original): The method of claim 1, wherein the query corresponds to a plurality of 
items in the collection. 

Claim 17 (original): The method of claim 1, wherein the query is independent of the items in the 
collection. 

Claim 18 (original): The method of claim 1, wherein the step of obtaining a result is constrained 
to a subcollection of the items in the collection. 

Claim 19 (original): The method of claim 18, wherein the subcollection is specified as an 
expression of properties. 

Claim 20 (original): The method of claim 19, wherein the expression includes a subset of the set 
of properties that compose the query. 
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Claim 21 (original): The method of claim 1, wherein the step of obtaining a query includes 
identifying certain properties to be ignored in the step of obtaining a result. 

Claim 22 (original): The method of claim 1, wherein the distance function is applied explicitly. 

Claim 23 (original): The method of claim 1, wherein the distance function is applied implicitly. 

Claim 24 (original): The method of claim 23, wherein the step of obtaining a result includes the 
step of iterating a random walk process to select potential result items. 

Claim 25 (original): The method of claim 24, wherein the step of obtaining a result includes 
ranking the potential result items by frequency and selecting the potential result items having 
higher frequencies. 

Claim 26 (original): The method of claim 23, wherein the step of obtaining a result includes 
iterating through one or more subsets of the query and identifying items associated with the one 
or more subsets. 

Claim 27 (original): The method of claim 26, wherein the one or more subsets are prioritized 
according to the number of items in the collection that have all of the properties in each subset 
and wherein iterating through one or more subsets of the query is continued until a first threshold 
is reached. 

Claim 28 (original): The method of claim 1, wherein the step of obtaining a result includes 
applying a Euclidean distance function. 

Claim 29 (original): The method of claim 28, wherein the step of obtaining a result includes 
merging a first result determined by applying the distance function and a second result 
determined by applying the Euclidean distance function. 

Claim 30 (original): The method of claim 28, wherein the step of obtaining a result includes 
determining a first result by applying either the distance function or the Euclidean distance 
function and applying the other distance function to the first result. 

Claim 31 (previously submitted): A method for analyzing two sets of properties from a plurality 
of sets of properties, comprising the steps of: 

determining a set of properties common tojhe two sets of properties; 
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determining the number of sets of properties from the plurality of sets of properties that 
include the set of common properties; and 

assessing the distance between the two sets of properties as a function of the number of 
sets of properties that include the set of common properties. 

Claim 32 (currently amended): A method for analyzing the relationship between iwe -first and 
second items in a collection of items, wherein each item in the collection is associated with a set 
of properties, the first item having a first set of properties and the second item having a second 
set of properties, comprising the steps of: 

obtaining a third set of properties with which the two items ar e commonly 
as s ociated common to the first set of properties and the second set of properties ; and 

determining the degree of commonality between the two items first item and the second 
item as a function of the number of items in the collection that are associated with all of the 
properties with which the two items are commonly associated in the third set of properties . 

Claim 33 (previously submitted): A computer program product, residing on a computer readable 
medium, for use in searching a collection of items, the computer program product comprising 
instructions for causing a computer to: 

receive a query composed of one or more properties; and 

obtain a result based on applying a distance function to the query and an item in the 
collection having a second set of one or more properties, 

wherein the distance function determines a third set of properties common to the first set 
of one or more properties and the second set of one or more properties, and determines a distance 
between the query and an item in the collection based on the number of items in the collection 
that are associated with all of the properties in the third set of properties. 

Claim 34 (original): The computer program product of claim 33, wherein the instructions cause 
the computer to obtain a result by identifying exactly the items whose distance from the query is 
within a threshold. 



BOSTON 1 960904v 1 



5 of 11 



Appl.No.: 10/027,195 
Reply Dated: ' August 3, 2004 
Office Action of: May 2 1 , 2004 



Atty. Docket No. 109878.125 



Claim 35 (original): The computer program product of claim 33, wherein the instructions cause 
the computer to obtain a result by identifying approximately the items whose distance from the 
query is within a threshold according to a heuristic. 

Claim 36 (original): The computer program product of claim 35, wherein the heuristic permits a 
trade-off between the accuracy and the performance of a search. 

Claim 37 (original): The computer program product of claim 35, wherein the heuristic includes 
the use of a random walk process. 

Claim 38 (previously submitted): A computer system for managing data records comprising: 

an information retrieval subsystem that stores and retrieves data records, each data record 
being associated with a set of properties; and 

a similarity search subsystem that receives similarity search queries and processes 
similarity search queries based on a distance function, a similarity search query being associated 
with a first set of properties, 

wherein the distance function determines a distance between the query and a data record 
in the collection having a second set of properties based on determining a third set of properties 
common to the first set of properties and the second set of properties, and determining the 
number of data records in the collection that are associated with all of the properties in the third 
set of properties. 

Claim 39 (original): The computer system of claim 38, further including a clustering subsystem 
that employs the distance function of the similarity search subsystem to construct a graph. 

Claims 40-41 (cancelled). 
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